Trading activity as driven Poisson process: 
comparison with empirical data 
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We propose the point process model as the Poissonian-hke stochastic sequence with 
' slowly diffusing mean rate and adjust the parameters of the model to the empirical 

^ . data of trading activity for 26 stocks traded on NYSE. The proposed scaled stochas- 

tic differential equation provides the universal description of the trading activities 
with the same parameters applicable for all stocks. 
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! 1 Introduction 

o 



^ . A possible way to avoid the inconsistence of geometric Brownian motion as a 



model of speculative markets is the assumption that the volatility is itself a 
time-depending random variable. Within this assumption there exist the dis- 
crete ARCH and GARCH models [T|2|3E] as well as the continuous stochastic 
volatility models [5ll6f7] . It is empirically established that the autocorrelation 
function of volatility decays very slowly with time exhibiting two different ex- 
ponents of the power-law distribution: for short time scales and for the long 
time ones. However, common stochastic volatility models are able reproduce 
only one time scale exponential decay [6l|8] . There is empirical evidence that 
the trading activity is a stochastic variable with the power-law probability 
distribution function (PDF) ^9)il0j and the long-range correlations [^111112] 
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resembling the power-law statistical properties of volatility. Empirical analy- 
sis confirms that the long-range correlations in volatility arise due to those 
of the trading activity [TT]. On the other hand, the trading activity can be 
modeled as the event flow of the stochastic point process with more evident 
microscopic interpretation of the observed power-law statistics. 

Recently, we proposed the stochastic model of the trading activity in the fi- 
nancial markets as Poissonian-like process driven by the stochastic differential 
equation (SDE) [TH|IT^ . Here we present the detailed comparison of the model 
with the empirical data of the trading activity for 26 stocks traded on NYSE. 
This enables us to present a more precise model definition based on the scaled 
equation, universal for all stocks. The proposed form of the difference equation 
for the intertrade time can be interpreted as a discrete iterative description of 
the proposed model, based on SDE. 



2 Stochastic model of trading activity 

We consider trades in the financial market as identical point events. Such point 
process is stochastic and defined by the stochastic interevent time = t^+i — 
tk, with tk being the occurrence times of the events. Recently we proposed 
p!3p^ to model the flow of trades in the financial markets as Poissonian-like 
process driven by the multiphcative stochastic equation, i.e. we define the rate 
n = 1/r of this process by the continuous stochastic differential equation 

dn = a' 1-7, + p at + ——-dW. 1 

2 \ nj ] [ne + 1)^ [ne + 1) 



This SDE with the Wiener noise W describes the diffusion of the stochas- 
tic rate n restricted in some area: from the side of the low values by the 
term m(no/n)™/2 and from the side of high values by the relaxation 70-. The 
general relaxation factor a'^n^/{ne + l)^dt is keyed with multiplicative noise 
arv'/'^ /{ne + l)dW^ to ensure the power-law distribution of n. The multiplica- 
tive noise is combined of two powers to ensure the spectral density of n with 
two power law exponents. This form of the multiplicative noise helps us to 
model the empirical probability distribution of the trading activity, as well. 
A parameter e defines the crossover between two areas of n diffusion. For 
more details see p^p^ . Equation ([1]) has to model stochastic rate n with two 
power- law statistics, i.e., PDF and power spectral density or autocorrelation, 
resembling the empirical data of the trading activity in the financial markets. 
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We will analyze the statistical properties of the trading activity N(t, Xd) de- 
fined as integral of n in the selected time window tj, 



iV(t,rd)= / n{t')dt'. (2) 



t 



The Poissonian-like sequence of trades described by the intertrade times 
can be generated by the conditional probability 

(f{Tk\n) = n exp{-nTk) . (3) 



In the case of single exponent power law model, see when PDF of n 

is P(n) ~ exp{ — {no/n)^}, the distribution P{Tk) of intertrade time in 
A;-space has the integral form 



C I exp[ 





-nTk)n^ '*'exp 



n 



dn. 



(4) 



with C defined from the normalization, P{Tk)dTk = 1. The explicit expres- 
sions of the integral (jlj) are available for the integer values of m. When m = 1, 
PDF (jlj) is expressed through the Bessel function of the second kind whereas 
for m > 1 the more complicated structures of distribution P{Tk) expressed in 
terms of hypergeometric functions arise. 

Now we have the complete set of equations defining the stochastic model of the 
trading activity in the financial markets. We proposed this model following our 
increasing interest in the stochastic fractal point processes [T^16|17P5] . Our 
objective to reproduce in details statistics of trading activity conditions rather 
complicated form of the SDE ([1]) and low expectation of analytical results. In 
this paper we focus on the numerical analysis and direct comparison of the 
model with the empirical data. In order to achieve more general description 
of statistics for different stocks we introduce the scaling to Eq. ([T]) with scaled 
time ts = cr'^riQi, scaled rate x = n/riQ and e' = eriQ. Then Eq. ([1]) becomes 



dx 



m 

7a) + —X 



-dWs. (5) 



{xe' + 1)2 {xe' + 1) 



The parameter tlq specific for various stocks is now excluded from the SDE 
and we have only three parameters to define from the empirical data of trad- 
ing activity in the financial markets. We solve Eq. ([5]) using the method of 
discretization. Introducing the variable step of integration Atg = = k^/x^, 
the differential equation ([5]) transforms to the difference equation 
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2 / 

^fc+i = tk + 



m 



(1 - 7.) + yx- 



{xue' + 1)2 (xfce' + 1) 



(6) 
(7) 



with K <^ 1 being a small parameter and Sk defining Gaussian noise with zero 
mean and unit variance . 

With the change of variables r = 1/n one can transform Eq. ([T]) into 

1 



dr = a2 



m 













-dt + a- 



t + T 



with limiting time Tq = l/nQ. We will show that this form of driving SDE is 
more suitable for the numerical analysis. First of all, the powers of variables 
in this equation are lower and the main advantage is that the Poissonian-like 
process can be included into the procedure of numerical solution of SDE. We 
introduce a scaling of Eq. (IHl) with the nondimensional scaled time ts = t/ro, 
scaled intertrade time y = t/tq and e' = e/ro. Then Eq. (IHl) becomes 



m ^ 



a 



roe' + y 



(9) 



As in the real discrete market trading we can choose the instantaneous in- 
tertrade time yk as a step of numerical calculations, hk = yk, or even more 
precisely as the random variables with the exponential distribution P{hk) = 
l/ykexp{—hk/yk)- Then we get the iterative equation resembling tick by tick 
trades in the financial markets, 



Vk+i = yk + 



a 

^0 



^ m 



hi 



, (y Vvkhk 

H — £k- 



(e' + ?/fc)2 Toe' + yk 



(10) 



In this numerical procedure the sequence of 1/yk gives the modulating rate 
and the sequence of hk is the Poissonian-like intertrade times. Seeking higher 
precision one can use the Milshtein approximation instead of Eq. ffTOj) . 



3 Analysis of empirical stock trading data 



We will analyze the tick by tick trades of 26 stocks on NYSE traded for 27 
months from January, 2005. An example of the empirical histograms of and 
N{t, Td) and power spectrum of IBM trade sequence are shown on figure 1. 
We will adjust the parameters of the Poissonian-like process driven by SDE 
Eq. dl]) or Eq. (fTUjl to reproduce numerically the empirical trading statistics. 
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P(i:> P(N) 
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Fig. 1. Trading statistics of IBM stocks, a) Histogram -P(t) of the intertrade time 
Tk sequence; b) Histogram of trading activity P{N) calculated in the time interval 
Trf = 10 min; c) Power spectral density S{f) of the sequence of trades, straight lines 
approximate power spectrum S{f) 1//^^'^ with Pi = 0.33 and (^2 = 0.94. 



The histograms and power spectrum of the sequences of trades for all 26 
stocks are similar to IBM shown on Fig. [H From the histogram P{Tk) we 
define a model parameter tq for every stock. One can define the exponent A' 
from the power-law tail of the histogram P{N) ~ A^~^ . The power spectrum 
exhibits two scaling exponents f3i and when approximated by power-law 
S{f) ~ 1//^^'^ The empirical values of P2, tq and A' for = 10 min are 
presented in Tabled! 

Values of Pi and P2 fluctuate around 0.3 and 0.9, respectively, as in the separate 
stochastic model realizations. The crossover frequency fc of two power-laws 
exhibits some fluctuations around the value fc ~ 10~^ Hz as well. One can 
observe considerable fluctuations of the exponent A' around the mean value 
4.4. Notice that value of exponent A' for integrated trading activity is higher 
than for n. Our analysis shows that the explicit form of the P{Tk), Eq. (jl]) with 
m = 2 and A = 2.7, fits empirical histogram of r^, for all stocks very well and 
fitting parameter tq can be defined for every stock. Values of tq are presented 
on Table [TJ From the point of view of the proposed model the parameter 
To is specific for every stock and reflects the average trading intensity in the 
calm periods of stock exchange. We eliminate these specific differences in our 
model by scaling transform of Eq. arriving to the nondimensional SDE 
and its iterative form (ITOl) . These equations and parameters a' = ct/tq, 70-, 
e' and m = 2 define our model, which has to reproduce in details power-law 
statistics of the trading activity in the financial markets. From the analysis 
based on the research of fractal stochastic point processes |13fl4lll5fltifl7lll8] 



5 



Table 1 



The empirical values of tq, Pi, P2 and A'. 



Stock 


To 


Pi 


P2 


A' 


Stock 


To 


Pi 


P2 


A' 


ABT 


7 


0.27 


0.92 


5.8 


JNJ 


4 


0.31 


0.87 


4.4 


ADM 


8 


0.29 


0.96 


3.5 


JPM 


4.5 


0.3 


0.8 


5.7 


BA 


8 


0.32 


0.84 


4.5 


KO 


6 


0.29 


0.85 


6.5 


BMY 


7 


0.3 


0.9 


4.4 


LLY 


8 


0.32 


0.84 


4.5 


C 


3 


0.31 


0.87 


4.6 


MMM 


8 


0.26 


0.96 


4.8 


CVX 


4 


0.3 


0.87 


6.4 


MO 


5 


0.36 


0.9 


3.6 


DOW 


7 


0.32 


0.88 


5.7 


MOT 


4 


0.29 


0.93 


3.4 


FNM 


9 


0.36 


0.95 


3.4 


MRK 


5 


0.35 


0.82 


3 


FON 


8 


0.29 


0.97 


3.4 


SLE 


10 


0.2 


0.75 


4.5 


GE 


2.25 


0.27 


0.87 


4.7 


PEE 


2 


0.31 


0.96 


3.6 


GM 


6 


0.36 


0.93 


2.7 


T 


10 


0.28 


0.88 


3.7 


HD 


4 


0.32 


0.95 


5.4 


WMT 


4 


0.31 


0.84 


4.7 


IBM 


5 


0.3 


0.87 


4.1 


XOM 


3 


0.34 


0.9 


4.4 


Average 


5.8 


0.305 


0.89 


4.4 
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Eig. 2. Distribution of the Poissonian-like interevent time = hk, (a), and power 
spectrum, (b), of the sequence of point events calculated from Eq. pOj) with the 
adjusted parameters a' = 0.006, 70- = 0.64, e' = 0.05. 

and by fitting the numerical calculations to the empirical data we arrive at 
the collection of parameters a' = 0.006, 7o- = 0.64, e' = 0.05. In figure [2] we 
present histogram of the sequence of Tk = h^, (a), and the power spectrum of 
the sequence of trades as point events, (b), generated numerically by Eq. ffTOl) 
with the adjusted parameters. 

For every selected stock one can easily scale the model sequence of intertrade 
times Tfc = hk by empirically defined tq to get the model sequence of trades for 
this stock. One can scale the model power spectrum S{f) by I/tq for getting 
the model power spectrum Sstock{f) for the selected stock Sstock{f) = S{f)/TQ. 
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Fig. 3. Distribution P{t) of intertrade time r for IBM and MMM stocks; empirical 
histogram, tick gray line, modeled Poissonian-like distribution, solid line, distribu- 
tion of driving r = in Eq. p^ . dashed line. 
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Fig. 4. Modeled, smooth curves, and empirical, sharp curves, power spectra of trad- 
ing activity for IBM and MMM stocks. 
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Fig. 5. Modeled, thin line, and empirical, tick line, trading activities for IBM and 
MMM stocks in the time interval = 300s. 



We proposed the iterative Eq. ffTOj) as quite accurate stochastic model of trad- 
ing activity in the financial markets. Nevertheless, one has to admit that real 
trading activity often has considerable trend as number of shares traded and 
the whole activity of the markets increases. This might have considerable in- 
fluence on the empirical long range distributions and power spectrum of the 
stocks in consideration. The trend has to be eliminated from the empirical 
data for the detailed comparison with the model. Only few stocks from the 
selected list in Table [T]have stable trading activity in the considered period. In 
figures [31 IH and we provide the comparison of the model with the empirical 
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data of those stocks. As we illustrate in figure [3l the model Poissonian-like dis- 
tribution can be easily adjusted to the empirical histogram of intertrade time, 
with To = 5 s for IBM trade sequence and with tq = 8 s for MMM trading. The 
comparison with the empirical data is limited by the available accuracy, 1 s, of 
stock trading time tk- The probability distribution of driving t = yu Eq. ffTOj) . 
dashed line, illustrates different market behavior in the periods of the low and 
high trading activity. The Poissonian nature of the stochastic point process 
hides these differences by considerable smoothing of the PDF. Figure H] illus- 
trates that the long range memory properties of the trading activity reflected 
in the power spectrum are universal and arise from the scaled driving SDK 
([5]) and ([9]). One can get power spectrum of the selected stock trade sequence 
scaling model spectrum, figure [2] (b), with I/tq. The PDF of integrated trad- 
ing activity N is more sensitive to the market fluctuations. Nevertheless, as 
we demonstrate in figure the model is able to reproduce the power-law tails 
very well. 



4 Conclusions 



We proposed the generalization of the point process model as the Poissonian- 
like sequence with slowly diffusing mean interevent time [13] and adjusted the 
parameters of the model to the empirical data of trading activity in the finan- 
cial markets. A new form of scaled equations provides the universal description 
with the same parameters applicable for all stocks. The proposed new form 
of the continuous stochastic differential equation enabled us to reproduce the 
main statistical properties of the trading activity and waiting time, observable 
in the financial markets. In proposed model the fractured power-law distribu- 
tion of spectral density with two different exponents arise. This is in agreement 
with the empirical power spectrum of the trading activity and volatility and 
implies that the market behavior may be dependent on the level of activity. 
One can observe at least two stages in market behavior: calm and excited. 
Ability to reproduce empirical PDF of intertrade time and trading activity 
as well as the power spectrum in every detail for various stocks provides a 
background for further stochastic modeling of volatility. 
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